Document processing device, document processing method, and recording medium

ABSTRACT

A document processing device comprises: a document data acquiring part for acquiring document data; a character string extracting part for extracting character strings satisfying a predetermined condition for character string extraction from the document data acquired by the document data acquiring part; a format creating part for deriving the respective features of the character strings extracted by the character string extracting part, and for creating a format containing the derived features in the form of data; a display part on which the character strings extracted by the character string extracting part are displayed in a list form, and on which the format created by the format creating part is displayed; and a format correcting part for correcting the format displayed on the display part. The character string extracting part extracts character strings again to conform to the format corrected by the format correcting part.

This application is based on the application No. 2009-183445 filed inJapan, the contents of which are hereby incorporated by reference.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to a document processing device, adocument processing method, and a recording medium. The presentinvention more specifically relates to a technique of extractingcharacter strings such as a heading from a document in electronic form.

2. Description of the Background Art

Image processing devices identified by names such as complex devices orMFPs (multifunction peripherals) are capable of converting document dataacquired by scanning an original into a certain file format such as PDF(portable document format), and outputting the converted document data.For such data output, character strings such as a title and a heading ofeach chapter and each item contained in the document may be extracted,and the document data may be output with the extracted character stringsadded thereto as bookmark data. This enhances the convenience in the useof document data.

An example of such a conventional technique of automatically extractingcharacter strings is disclosed in Japanese Patent Application Laid-OpenNo. JP 2008-305088 A. In this conventional technique, content regionssuch as character strings contained in a document are extracted by rowsfrom an image of the document, and all the extracted content regions areclassified into at least one group. Then, based on the respectivepositions of the content regions of each classified group in thedocument image, the suitability of the group as a bookmark is evaluated.Based on a result of the evaluation, at least one group is selected froma group has the highest level of suitability is selected as a group tobe a target of generation of bookmark data. Thereafter, based on theattribute information of content regions of the selected group, bookmarkdata indicating the respective positions of the content regions of theselected group in the document image is generated. So, this conventionaltechnique is capable of automatically extracting character strings suchas headings contained in a document.

According to a conventional technique disclosed for example in JapanesePatent Application Laid-Open No. JP 2008-305089 A, generation ofbookmark data allows a user to easily find the respective positions ofdocument contents throughout the document and the respective types ofthe document contents.

In the above-described conventional techniques, character strings suchas headings contained in a document are extracted under a predeterminedcondition. This may result in extraction of a character string thatfails to satisfy a user. Correcting a condition under which characterstrings are extracted from document data is required in order to avoidextraction of the unintended character string. However, the conventionaltechniques fail to efficiently correct the condition.

An optimum condition differs for each type of document depending on itsrespective setting, such as a documentary form. At the same time, a usercan freely make the setting of a document such as a documentary form, soit is difficult to define an optimum condition in advance that matchesall documentary forms. For this reason, when a character string failingto satisfy a user is extracted as a bookmark, it is desirable that thecondition is to be corrected with a relatively simple operation.

SUMMARY OF THE INVENTION

The present invention has been made to solve the above-describedproblems. An object of the present invention is to provide a documentprocessing device, a document processing method, and a recording mediumthat are capable of efficiently correcting a character string failing tosatisfy a user's intention that is extracted as a result of extractionof character strings such as headings from a document to be processed.

First, the present invention is directed to a document processingdevice. According to an aspect of the document processing device, thedevice comprises: a document data acquiring part for acquiring documentdata; a character string extracting part for extracting characterstrings satisfying a predetermined condition for character stringextraction from the document data acquired by the document dataacquiring part; a format creating part for deriving the respectivefeatures of the character strings extracted by the character stringextracting part, and for creating a format containing the derivedfeatures in the form of data; a display part on which the characterstrings extracted by the character string extracting part are displayedin a list form, and on which the format created by the format creatingpart is displayed; and a format correcting part for correcting theformat. The character string extracting part extracts character stringsagain to conform to the format corrected by the format correcting part.

Second, the present invention is directed to a document processingmethod. According to an aspect of the document processing method, themethod comprises the steps of: (a) acquiring document data; (b)extracting character strings satisfying a predetermined condition forcharacter string extraction from the acquired document data; (c)deriving the respective features of the extracted character strings, andcreating a format containing the derived features in the form of data;(d) displaying the extracted character strings in a list form, anddisplaying the format; (e) correcting the format; and (f) when theformat is corrected, extracting character strings again to conform tothe corrected format.

Third, the present invention is directed to a computer-readablerecording medium on which a program is recorded. According to an aspectof the computer-readable recording medium, the program recorded on therecording medium causes a computer to operate as a system comprising: adocument data acquiring part for acquiring document data; a characterstring extracting part for extracting character strings satisfying apredetermined condition for character string extraction from thedocument data acquired by the document data acquiring part; a formatcreating part for deriving the respective features of the characterstrings extracted by the character string extracting part, and forcreating a format containing the derived features in the form of data; adisplay part on which the character strings extracted by the characterstring extracting part are displayed in a list form, and on which theformat created by the format creating part is displayed; and a formatcorrecting part for correcting the format. The program also causes thecharacter string extracting part to extract character strings again toconform to the format corrected by the format correcting part.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows an exemplary configuration of a document processing systemaccording to a preferred embodiment of the present invention;

FIG. 2 is a block diagram showing an exemplary hardware configuration ofan image processing device functioning as a document processing device;

FIG. 3 shows functional blocks in a controller when the image processingdevice functions as a document processing device;

FIG. 4 is a block diagram showing in detail an exemplary structure of adocument data processing part;

FIG. 5 shows the concept of a format containing the respective featuresof character strings in the form of data that are extracted as abookmark;

FIG. 6 shows an example of a screen relating to bookmark extractiondisplayed on a display device;

FIGS. 7 to 10 each shows another example of a screen relating tobookmark extraction displayed on the display device;

FIG. 11 is a flow diagram explaining an exemplary operation of the imageprocessing device;

FIG. 12 is a flow diagram explaining in detail an exemplary procedure ofa bookmark process;

FIG. 13 is a flow diagram explaining in detail an exemplary procedure ofa character string extracting process;

FIG. 14 a flow diagram explaining in detail an exemplary procedure of aformat editing process;

FIG. 15 shows an example of a display screen displayed on a display unitof an operational panel; and

FIGS. 16, 17, 18A and 18B, and 19A and 19B each shows an example of adisplay screen displayed on the display unit of the operational panel.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

A preferred embodiment of the present invention is described in detailbelow with reference to figures. In the description given below, thoseelements which are shared in common in the preferred embodiment arerepresented by the same reference numerals, and these elements are notdiscussed repeatedly for the same description.

FIG. 1 shows an exemplary configuration of a document processing systemaccording to a preferred embodiment of the present invention. Thedocument processing system includes: an image processing device 1 suchas a MFP; a computer 3 formed from a generally-used personal computer orthe like; and a server 4 having server functions such as a mail serveror a data server. These components are connected to each other through anetwork 9 such as a LAN in a manner that allows data communicationtherebetween. An external network such as the Internet may be connectedto the network 9.

In the preferred embodiment, the image processing device 1 has afunction of a document processing device. The document processing deviceof the present invention may be applied not only to the image processingdevice 1, but also to the computer 3, the server 4, or other devices.

The image processing device 1 shown in FIG. 1 has several functionsincluding a scan function, a BOX function, a print function, and a copyfunction. The scan function is intended to read originals to generateimage data, and save the generated image data to a storage device 36(see FIG. 2) in the image processing device 1, or transmit the generatedimage data through the network 9 to a destination such as the computer 3or the server 4. The BOX function is intended to perform processing ondocument data stored in a BOX of the image processing device 1. As anexample, with the BOX function, document data stored in the BOX is read,and the read document data is transmitted to a destination such as thecomputer 3 or the server 4. In the preferred embodiment, the BOXcorresponds to a storage region 38 (see FIG. 2) for document datastorage provided in the storage device 36. The print function isintended to read document data received from a source such as thecomputer 3 through the network 9 or document data stored in the BOX, andproduce printed outputs. The copy function is intended to readoriginals, and produce copy outputs.

As shown in FIG. 1, the image processing device 1 includes a scannerunit 13 for reading originals provided at the upper part of a devicebody 1 a. The scanner unit 13 includes an image reader 11 for readingthe image of each page of originals one by one, and an automaticdocument feeder (ADF) 12 from which one page is fed, or multiple pagesof an original are fed one by one to the image reader 11, by which thescanner unit 13 is allowed to realize continuous and automatic readingof the multiple pages of the original, for example. An image formingunit 14 for forming images on output media such as sheets is provided atthe central part of the device body la to produce printed outputs andcopy outputs. The image processing device 1 also includes an operationalpanel 17 arranged on the front side of the scanner unit 13 and operableby a user. The operational panel 17 includes a display unit 15 on whichvarious types of information are displayed to a user, and operation keys16 operated by the user. The operation keys 16 include push-button keysarranged around the display unit 15, and touch panel keys arranged onthe screen of the display unit 15.

The image processing device 1 also includes a display device 20supported by an upwardly extending support provided on the back side ofthe device body 1 a, and a mouse 21 arranged on a working table 19attached to a lateral side of the device body 1 a. The display device 20is a display unit on which images can be displayed in a resolutionhigher than that of images displayed on the display unit 15 of theoperational panel 17. The mouse 21 is used together with the operationkeys 16 of the operational panel 17 to make entries to the imageprocessing device 1. In the preferred embodiment, the mouse 21 is usedto operate a mouse pointer displayed on the display device 20.

In the image processing device 1 of the above-described structure,document data is acquired by using the scan function or the BOXfunction. When bookmark extraction is ordered by a user, bookmark datais generated by extracting character strings such as headings based onthe acquired document data, and the generated bookmark data is added tothe document data. The image processing device 1 is described in moredetail below.

FIG. 2 is a block diagram showing an exemplary hardware configuration ofthe image processing device 1. As shown in FIG. 2, the image processingdevice 1 includes, in addition to the above-described components, acontroller 30 for controlling each part, an external interface 33 towhich the display device 20 and the mouse 21 are connected, a networkinterface 34 to which the network 9 is connected, an image memory 35 inwhich data such as image data read from originals by the scanner unit 13are temporarily stored, and the storage device 36 formed from anonvolatile storage element such as a hard disk drive.

The controller 30 includes a CPU 31 and a memory 32. The CPU 31 readsand executes a program 37 stored in the storage device 36, so that theCPU 31 becomes operative to control each part. The memory 32 stores datatherein such as temporary data when the CPU 31 executes the program 37.Execution of the program 37 by the CPU 31 causes the controller 30 toperform various processing. Especially in the preferred embodiment, thecontroller 30 is responsible for principal processing to be performed ina document processing device. More specifically, the controller 30acquires document data, and extracts character strings as a bookmarkthat satisfy a predetermined condition (condition for character stringextraction) under which character strings are to be extracted from theacquired document data. Then, the controller 30 derives the respectivefeatures of the extracted character strings, and creates a format 39containing the derived features in the form of data. The details of theformat 39 are described later. Further, the controller 30 displays thecharacter strings extracted from the document data on the display device20, by which a user is allowed to see whether or not the characterstrings extracted as a bookmark achieve a desirable result ofextraction. The controller 30 also displays the format 39 containing therespective features of the extracted character strings in the form ofdata on the display device 20. When determining that the extraction isnot made as desired after checking the extracted character strings, theuser makes an operation to correct the format 39 displayed on thedisplay device 20 with the mouse 21, the operation keys 16 and others.The controller 30 accepts the operation, and corrects the format 39accordingly. Then, the controller 30 extracts character strings againfrom the document data to conform to the corrected format 39.

The storage device 36 contains, in addition to the above-describedprogram 37, a storage region 38 which corresponds to the above-describedBOX, and which is set for example for each of a plurality of users.Document data 40 is stored in each of the storage regions 38. Thedocument data 40 stored in the storage regions 38 may be generated byreading of the images of originals by the scanner unit 13, or may beacquired from a source such as the computer 3 through the network 9.

As shown in FIG. 2, the storage device 36 stores a format 39 and aformat management table 39 a therein. The format 39 contains therespective features of character strings in the form of data extractedfrom document data as already discussed, and is stored in the storagedevice 36 when a user orders registration of the format 39 with theimage processing device 1, for example. The format 39 is described inmore detail later. The storage device 36 can store multiple formats 39therein. The format management table 39 a is table data that manages theformats 39 stored in the storage device 36. The format management table39 a records information indicating correspondence relation for each ofthe format 39. By way of example, the correspondence relation betweenthe format 39 and information of a user who ordered registration of theformat 39, or information of the document data attribute (such as filename) is recorded.

FIG. 3 shows functional blocks in the controller 30 when the imageprocessing device 1 functions as a document processing device. In thiscase, the controller 30 functions as a document data acquiring part 41,a document data processing part 42, a display processing part 43, and anoutput processing part 44, and extracts a bookmark from document data.In the preferred embodiment, a “bookmark” is an aggregate of characterstrings such as a heading, a title and a subtitle included in adocument. When a document consists of a number of parts including achapter, a section and an article, for example, the respectiverepresentative character strings included in the number of parts (suchas a chapter title, a section title and an article title) are extracted.The extracted character strings included in the bookmark are ranked inorder of appearance in the document.

The document data acquiring part 41 acquires document data as a targetof bookmark extraction. As an example, when the scan function of theimage processing device 1 is used, the document data acquiring part 41acquires document data stored in the image memory 35 obtained by readingof the image of an original by the scanner unit 13. When the BOXfunction of the image processing device 1 is used, the document dataacquiring part 41 reads and acquires the document data 40 from a storageregion 38 specified by a user. The document data acquiring part 41 has acharacter recognizing section 41 a formed from a device such as an OCR(optical character reader). When document data is image data that is notin text format, for example, the character recognizing section 41 a isput into operation to extract all characters contained in the imagedata.

The document data processing part 42 extracts the bookmark from documentdata acquired by the document data acquiring part 41. In response to auser's instruction given through the operation keys 16 or the mouse 21,the document data processing part 42 extracts character strings to bethe bookmark from the document data. The character string extractingprocessing may be repeated as many times as needed in response to theuser's instruction. Then, in response to the user's instruction, thedocument data processing part 42 generates bookmark data based on thecharacter strings extracted in the character string extractingprocessing. The document data processing part 42 thereafter adds thegenerated bookmark data to the document data, and outputs the documentdata to the output processing part 44.

The display processing part 43 displays a result of character stringextracting processing performed in the document data processing part 42and the corresponding format 39 on the display device 20. Also, theoutput processing part 44 outputs the document data to which bookmarkdata created by the document data processing part 42 is added to thecomputer 3 through the network interface 34, or saves the document datato the storage region 38 of the storage device 36.

FIG. 4 is a block diagram showing in detail an exemplary structure ofthe document data processing part 42. The document data processing part42 functions as a character string extracting part 45, a format creatingpart 46, a format correcting part 47, a format applying part 48, and aformat recording part 49.

The character string extracting part 45 extracts character strings thatare candidates for a bookmark (bookmark candidate character strings)based on the document data 40 received from the document data acquiringpart 41. The bookmark candidate character strings correspond to aheading, a title, a subtitle and others extracted from all characterstrings contained in document data, and which are not yet determined asthose being a bookmark as the user intended. The character stringextracting part 45 extracts bookmark candidate character strings in twoways as follows.

The first way is based on a predetermined condition for character stringextraction defined in advance in the program 37 to extract characterstrings that satisfy the condition for character string extraction. Thisway is disclosed, for example, in Japanese Patent Application Laid-OpenNo. JP 2008-305088 A. In the first way, the bookmark candidate characterstrings are extracted under the predetermined condition for characterstring extraction from document data.

The second way is to extract with the format 39 as described above. Inthe second way, character strings conforming to features defined in theformat 39 are extracted as the bookmark candidate character strings fromall character strings contained in the document data 40.

The format creating part 46 is put into operation when the characterstring extracting part 45 extracts character strings in the first way asdescribed above. More specifically, the format creating part 46 derivesthe respective features of the extracted character strings, and createsthe format 39 containing the derived features in the form of data. Thecreation processing of the format 39 by the format creating part 46 isdescribed in detail later. The format 39 is already present after thecharacter string extracting part 45 extracts character strings in thesecond way. So, in this case, the format creating part 46 does notcreate another format 39.

FIG. 5 shows the concept of the format 39. As shown in FIG. 5, theformat 39 is information indicating the respective features of bookmarkcandidate character strings extracted by the character string extractingpart 45. The format 39 contains common positional condition information,common keyword information, common numerical information, and commonattribute information concerning four types of features. The format 39has several hierarchical levels, in each one of which the commonpositional condition information, common keyword information, commonnumerical information, and common attribute information are defined. InFIG. 5, the format 39 is shown to have a three-level hierarchicalstructure including first, second and third hierarchical levels,however, the hierarchy structure of the format 39 is not necessarilylimited to the third-level hierarchy structure. This hierarchy structurecorresponds to a hierarchical heading structure of bookmark candidatecharacter strings including headings of first-level, second-level andthird-level items (hereafter, first-level headings, second-levelheadings and third-level headings), and headings of items of otherlevels contained in a document. As an example, the first hierarchicallevel corresponds to the first-level headings. In the first hierarchicallevel, information in reference to common features for the first-levelheadings is contained as the common positional condition information,common keyword information, common numerical information, and commonattribute information, respectively. The second hierarchical levelcorresponds to the second-level headings. In the second hierarchicallevel, information in reference to common features for the second-levelheadings is contained as the common positional condition information,common keyword information, common numerical information, and commonattribute information, respectively. The third hierarchical levelcorresponds to the third-level headings. In the third hierarchicallevel, information in reference to common features for the third-levelheadings is contained as the common positional condition information,common keyword information, common numerical information, and commonattribute information, respectively.

The common positional condition information contains range informationconcerning bookmark candidate character strings in each hierarchicallevel. The distance information indicates a range from a referenceposition in the document (left end of a sheet in the case of ahorizontally written document) of the bookmark candidate characterstrings, thereby specifying an area in the document in which thesebookmark candidate character strings are placed. In the case of thefirst-level headings, for example, information for specifying an area inwhich the bookmark candidate character strings to be the first-levelheadings are placed is stored as the common positional conditioninformation of the first hierarchical level. When a differentindentation is set for each of the first-level, second-level andthird-level items, for example, their respective indentations areincorporated into the common positional condition information.

The common keyword information indicates a character or a characterstring shared in common among bookmark candidate character strings ineach hierarchical level. When the first-level headings contain characterstrings such as “Chapter 1” and “Chapter 2,” for example, “chapter” isthe common keyword information of the first hierarchical level.

The common numerical information indicates the location of a numbercontained in bookmark candidate character strings in each hierarchicallevel, and which is shared in common among these bookmark candidatecharacter strings. It is assumed, for example, that the first-levelheadings are “Chapter 1 . . . (name of main theme) . . . ,” “Chapter 2(name of main theme) . . . ,” and others. In this case, the initialportion of each character string contains a numerical value indicatingthe number of a corresponding chapter. So, in the common numericalinformation of the first hierarchical level, it is defined that theinitial portion of each character string includes numerical informationfor stating a corresponding first-level item. It is also assumed thatthe second-level headings are “1.1. . . . (name of subordinate theme) .. . ,” “1.2. . . . (name of subordinate theme) . . . ,” and others, forexample. In this case, the first to third initial letters (including adot as the second letter) in each character string corresponds to anumerical value stating the number of each second-level item. So, in thecommon numerical information of the second hierarchical level, it isdefined that the first to third initial letters in each character stringcontains numerical information for stating a corresponding second-levelitem. These are applied to the common numerical information of the thirdhierarchical level, and to those of lower levels. The extractedcharacter strings may be applied to a hierarchical structure based onthe common numerical information.

The common attribute information defines attribute information shared incommon among bookmark candidate character strings in each hierarchicallevel. By way of example, when a font, a text size, or decoration suchas underlying and italicizing is commonly used among the bookmarkcandidate character strings in each hierarchical level, the commonattribute is stored as the common attribute information.

When the character string extracting part 45 extracts character stringsin the first way, the format creating part 46 arranges the characterstrings extracted as bookmark candidates in a hierarchical structurebased on their respective features. Then, the format creating part 46generates the common positional condition information, common keywordinformation, common numerical information, and common attributeinformation as information for indicating the features shared in commonin each hierarchical level, thereby creating the format 39.

It is assumed, for example, that the indentation differs amongfirst-level, second-level and third-level items as described above. Inthis case, areas in which each character string extracted as bookmarkcandidate is placed differ according to the respective indentations. So,the format creating part 46 derives positional information, indicatingwhere each bookmark candidate character string is placed, as a featurerelating to the position of the bookmark candidate character string.When the same positional information is derived from several bookmarkcandidate character strings, these character strings are categorizedinto one group. So, several bookmark candidate character strings havinga common feature relating to a position are categorized into the samegroup. A bookmark candidate character string that cannot be categorizedinto any group is eliminated from consideration as a bookmark, unless ithas another feature that is shared in common between a differentbookmark candidate character string.

The format creating part 46 also derives keyword information from eachbookmark candidate character string as a feature relating to a keywordcontained in each bookmark candidate character string. When the bookmarkcandidate character strings contain keywords such as “chapter,”“section” and “article,” for example, these keywords are derived askeyword information. When the same keyword information is derived fromseveral bookmark candidate character strings, these character stringsare categorized into one group. So, several bookmark candidate characterstrings having a common feature relating to a keyword are categorizedinto the same group. A bookmark candidate character string that cannotbe categorized into any group is eliminated from consideration as abookmark, unless it has another feature that is shared in common betweena different bookmark candidate character string.

The format creating part 46 also derives numerical information from eachbookmark candidate character string as a feature relating to a numericalvalue contained in each bookmark candidate character string. As anexample, when the initial portion of each bookmark candidate characterstring includes a numerical value, the numerical value in the initialportion is derived as numerical information. When the same numericalinformation is derived from several bookmark candidate characterstrings, these character strings are categorized into one group. So,several bookmark candidate character strings having a common featurerelating to a numerical value are categorized into the same group. Abookmark candidate character string that cannot be categorized into anygroup is eliminated from consideration as a bookmark, unless it hasanother feature that is shared in common between a different bookmarkcandidate character string.

The format creating part 46 further derives attribute information fromeach bookmark candidate character string as a feature relating to theattribute of each bookmark candidate character string. As an example, asetting made for each bookmark candidate character string such as afont, a text size, or decoration such as underlying and italicizing isderived as attribute information. When the same attribute information isderived from several bookmark candidate character strings, thesecharacter strings are categorized into one group. So, several bookmarkcandidate character strings having a common feature relating to anattribute are categorized into the same group. A bookmark candidatecharacter string that cannot be categorized into any group is eliminatedfrom consideration as a bookmark, unless it has another feature that isshared in common between a different bookmark candidate characterstring.

As described, the format creating part 46 derives four types of featuresfrom each bookmark candidate character string, and categorizes thebookmark candidate character strings into groups with respect to each ofthe four types of features. So, four sets of groups are obtained fromthe four types of features.

The format creating part 46 selects at least one set from the four setsof groups, and arranges one or more groups contained in the selected setin a hierarchical structure. As an example, one or more groups obtainedby categorizing bookmark candidate character strings having a commonfeature relating to a position are arranged in a hierarchical structurecorresponding, for example, to the respective indentations of thebookmark candidate character strings. Then, the format creating part 46associates the bookmark candidate character strings with thehierarchical structure. As a result, the bookmark candidate characterstrings extracted from the document data 40 are arranged in ahierarchical structure. The format creating part 46 also associates oneor more groups included in the other set of groups with the hierarchicalstructure.

Then, based on the information of each group associated with thehierarchical structure, the format creating part 46 generatesinformation indicating that bookmark candidate character strings in eachhierarchical level have a common feature. More specifically, the formatcreating part 46 generates the above-described common positionalcondition information, common keyword information, common numericalinformation, and common attribute information for each hierarchicallevel, thereby creating the format 39.

By the processing described above, each of the common positionalcondition information, common keyword information, common numericalinformation, and common attribute information has a structurecorresponding to the hierarchical structure of bookmark candidatecharacter strings. As a result, the format 39 such as the one shown inFIG. 5 is created.

So, when the character string extracting part 45 is put into operationto extract bookmark candidate character strings from the document data40, for example, in the first way, and the format creating part 46creates the format 39 based on the extracted bookmark candidatecharacter strings, the document data processing part 42 outputs thebookmark candidate character strings together with the created format 39to the display processing part 43. This causes the bookmark candidatecharacter strings and the corresponding format 39 to be displayed on thedisplay device 20, by which a user is allowed to see whether or not adesirable result of bookmark extraction is achieved.

Returning to FIG. 4, the format correcting part 47 corrects the format39 based on user's instructions. The format correcting part 47 correctsthe common positional condition information, common keyword information,common numerical information, and common attribute information in eachhierarchical level shown in FIG. 5 in response to an operation forcorrection made by the user.

The format applying part 48 applies the format 39 corrected by theformat correction part 47 to the character string extracting part 45,thereby causing the character string extracting part 45 to extractcharacter strings again. For the second extraction, the character stringextracting part 45 uses the second way to extract character strings.More specifically, the character string extracting part 45 extractscharacter strings conforming to features defined in the format 39corrected by the format correcting part 47, so that bookmark candidatecharacter strings are extracted again from the document data 40. Then,based on a result of the second extraction, a display screen on thedisplay device 20 is updated. As already described, the format creatingpart 46 does not create another format 39 after character strings areextracted in the second way.

The format recording part 49 stores the format 39 created by the formatcreating part 46, or the format 39 corrected by the format correctingpart 47 into the storage device 36. It is assumed, for example, that theuser sees a display screen on the display device 20 to determine that adesirable result of bookmark extraction is achieved, and then givesinstructions to output the document with the addition of the extractedbookmark. In this case, the format recording part 49 stores the format39 at the time of receiving instructions into the storage device 36.

At the same time, the format recording part 49 enters information intothe format management table 39 a. This information associates the format39 to be stored in the storage device 36 with the logged-in user who hasmade the operation, or with the attribute of the document data 40. So,when the same user as the logged-in user logs into the image processingdevice 1 at a later time, the format 39 used in the past by this usercan be read automatically, for example. When a bookmark is to beextracted from document data that has the same attribute as that of thedocument data from which a bookmark was extracted in the past, theformat 39 used in the past can be read automatically. This enhances theconvenience of the image processing device 1. Examples of the attributeof the document data 40 include a character string contained in the filename of the document data 40.

Examples of screens displayed on the display device 20 in the processingof bookmark extraction are described next. FIGS. 6 to 10 each shows anexample of a screen relating to bookmark extraction displayed on thedisplay device 20. As shown in each of FIGS. 6 to 10, the screenrelating to bookmark extraction includes a bookmark format display field51 in which the above-described detail of the format 39 is displayed, abookmark preview field 52 in which character strings extracted asbookmark candidates are displayed in a hierarchical structure, and adocument preview field 53 in which the content of the document data 40as a target of bookmark extraction is previewed.

FIG. 6 shows a screen in a state where the document data 40 as a targetof bookmark extraction is entered into the document data processing part42. In this state, character strings to become bookmark candidates arenot extracted. So, on the screen shown in FIG. 6, the content of thedocument data 40 is previewed in the document preview field 53. Thedocument preview field 53 in FIG. 6 shows the preview of a documentincluding first-level, second-level and third-level items. Aftercharacter strings to become bookmark candidates are extracted in thefirst way, and a corresponding format 39 is created, the screen on thedisplay device 20 is changed to the one shown in FIG. 7.

After the bookmark extraction, the detail of the format 39 correspondingto the character strings extracted as bookmark candidates is displayedin the bookmark format display field 51 as shown in FIG. 7. In FIG. 7,character strings corresponding to items of three hierarchical levelsincluding the first-level, second-level and third-level items aredisplayed in “BOOKMARK (CHARACTER STRING)” columns. On the right side ofthe “BOOKMARK (CHARACTER STRING)” columns, common positional conditions,common keywords, common numerical information, and common attributeinformation, each shared as common features among character strings ineach hierarchical level, are associated with the hierarchical structureand displayed. Further, the character strings extracted as bookmarkcandidates are displayed in the hierarchical structure in the bookmarkpreview field 52. The character strings extracted as bookmark candidatesare displayed with decoration, for example, with rectangular boxes inthe document preview field 53, so that the parts extracted as bookmarkcandidates can easily be found.

Character strings extracted as bookmark candidates displayed in thebookmark format display field 51 and the bookmark preview field 52 arelinked to the document data 40. So, a character string displayed in eachof the bookmark format display field 51 and the bookmark preview field52, and a character string contained in a preview image displayed in thedocument preview field 53 are shown as being linked to each other. As anexample, when a user selects one from multiple extracted characterstrings, an image region corresponding to the selected character stringappears in a predetermined display position (such as left upper end) inthe document preview field 53.

Correction of the format 39 is described next. It is assumed, forexample, that the user checks character strings extracted as candidatesfor a bookmark displayed on the screen shown in FIG. 7, and determinesthat a desirable result of bookmark extraction is not achieved. In thiscase, the user corrects each piece of information displayed in thebookmark format display field 51 with the mouse 21 and the like, so thatthe user can correct the character strings as the bookmark.

FIG. 8 shows a screen in which character strings in the thirdhierarchical level are eliminated from the candidates for the bookmarkafter the user's operation for correction is made to the display screenshown in FIG. 7. As an example, the user operates the mouse 21 to move amouse pointer 54 to a check box of the third hierarchical leveldisplayed in the left end of the bookmark format display field 51 asshown in FIG. 8. Then, the user clicks the check box of the thirdhierarchical level to eliminate all the extracted character strings inthe third hierarchical level from the candidates for the bookmark, sothat the screen is changed from the one of FIG. 7 to the one of FIG. 8.That is, this operation deletes all the information in the thirdhierarchical level contained in the format 39, so that the format 39 iscorrected to be formed from first and second hierarchical levels. Then,character strings are extracted again as candidates for the bookmarkbased on the corrected format 39. The screen of FIG. 8 shows a result ofthe second extraction.

As shown in FIG. 8, the character strings extracted again as candidatesfor the bookmark are shown in a hierarchical structure in the bookmarkpreview field 52. More specifically, in FIG. 8, the character stringsassociated with the third hierarchical level are deleted. The characterstrings extracted as candidates for the bookmark are displayed withdecoration with rectangular boxes in the document preview field 53.Compared to the character strings seen in FIG. 7, those in FIG. 8corresponding to the third-level items (in the third hierarchical level)are no longer decorated with rectangular boxes.

FIG. 9 shows a screen in which the common keyword of the thirdhierarchical level is eliminated from the features shared in common inthe third hierarchical level after the user's operation for correctionmade to the display screen shown in FIG. 7. As an example, the useroperates the mouse 21 to move the mouse pointer 54 to a check boxcorresponding to a common keyword of the third hierarchical level asshown in FIG. 9. Then, the user clicks the check box to eliminate thecommon keyword of the third hierarchical level from the features sharedin common in the third hierarchical level, so that the screen is changedfrom the one of FIG. 7 to the one of FIG. 9. In FIG. 7, a word “Sub-sub”is a feature shared in common in the third hierarchical level as thecommon keyword of the third hierarchical level. The setting of thecommon keyword of the third hierarchical level is canceled as a resultof the above-described operation. More specifically, according to thisoperation, the common keyword of the third hierarchical level is deletedfrom the format 39, and the format 39 is corrected. Then, characterstrings are extracted again as candidates for a bookmark based on thecorrected format 39. The screen of FIG. 9 shows a result of the secondextraction.

As shown in FIG. 9, another third-level item “1.1.4. Small theme” thatdoes not include the word “Sub-sub” is newly added as a bookmarkcandidate to the bookmark preview field 52. Further, the characterstring of the third-level item added as the candidate for the bookmarkis displayed with decoration with a rectangular box in the documentpreview field 53.

FIG. 10 shows a screen in which a common positional condition of thesecond hierarchical level is corrected after the user's operation forcorrection made to the display screen shown in FIG. 7. As an example,the user operates the mouse 21 to move the mouse pointer 54 to arectangular box corresponding to the second hierarchical level(second-level item) in the document preview field 53 as shown in FIG.10. Then, the user drags the right edge of the rectangular box to adifferent position to correct the common positional condition of thesecond hierarchical layer. This is not the only way of correcting thecommon positional condition. As an example, a value indicating a rangefrom a reference position (a value of a region display field 51 a) in adocument displayed in the bookmark format display field 51 may beentered through a numeric keypad to correct the common positionalcondition. For the common positional condition of the secondhierarchical level shown in FIG. 10, the positions of the right edges ofrectangular boxes corresponding to the second level items are shifted tothe left as compared to the corresponding positions shown in FIG. 7. So,the common positional condition is corrected to define smaller areas inwhich character strings corresponding to the second-level items arearranged. The operation described above corrects the common positionalcondition of the second hierarchical level in the format 39, therebycorrecting the format 39. Then, character strings are extracted again ascandidates for the bookmark based on the corrected format 39. The screenof FIG. 10 shows a result of the second extraction.

As shown in FIG. 10, a second-level item “1.3. Sub-theme ABCDEF . . . ”that was one of candidates for the bookmark of the second hierarchicallevel at first is deleted from the bookmark preview field 52. Thesecond-level item “1.3. Sub-theme ABCDEF . . . ” is also shown to beeliminated from consideration as the bookmark in the document previewfield 53.

As described, in the preferred embodiment, the user makes an operationfor correcting the format 39 displayed on the display device 20 whencharacter strings extracted as candidates for the bookmark do notachieve a result of extraction desired by the user. Then, the imageprocessing device 1 extracts character strings again based on thecorrected format 39. The user's operation for correction is to correct afeature shared in common among character strings extracted as candidatesfor the bookmark (such as the common positional condition information,the common keyword information, the common numerical information, or thecommon attribute information). So, the format 39 is efficientlycorrected with a simple operation.

The operation of the image processing device 1 is described next. FIGS.11 to 14 are flow diagrams each explaining an exemplary operation of theimage processing device 1. The controller 30 of the image processingdevice 1 is mainly responsible for the processes to be performed basedon these flow diagrams. FIGS. 15 to 17, 18A and 18B, and 19A and 19Beach shows an exemplary display screen displayed on the display unit 15of the operational panel 17 of the image processing device 1.

As shown in FIG. 11, the image processing device 1 waits for a user tolog in its initial state (step S10). FIG. 15 shows a display screendisplayed at the waiting state on the display unit 15 of the operationalpanel 17. As shown in FIG. 15, an ID entry field 61 and a password entryfield 62 are displayed on the display unit 15 at the waiting state forlog in. A user operates, for example, the operation keys 16 of theoperational panel 17 to enter necessary information into these entryfields, and then operates an OK key 63. In response, the imageprocessing device 1 performs user authentication. The image processingdevice 1 identifies the user when the user is successfullyauthenticated, by which the image processing device 1 is put into astate in which the user logging into the image processing device 1 (aresult of step S10 is YES). A cancel key 64 shown in FIG. 15 is providedto cancel information entered into the ID entry field 61 or the passwordentry field 62.

After the image processing device 1 is put into the state in which theuser is logging into the image processing device 1 (when a result ofstep S10 is YES), a function that is set in advance for the user is madeavailable for the user. Then, the image processing device 1 receives auser's operation on the operational panel 17 (step S11).

When the user selects the scan function to order scan of an original(when a result of step S12 is YES), the image processing device 1 putsthe scanner part 13 into operation to read the original (step S13). Theimage processing device 1 thereafter determines whether or not the userhas ordered bookmark extraction (step S14). When the bookmark extractionis ordered (when a result of step S14 is YES), the image processingdevice 1 performs a bookmark processing (step S15). In this bookmarkprocessing, bookmark data is added to document data 40. In contrast, theimage processing device 1 does not perform the bookmark processing whenbookmark extraction is not ordered (when a result of step S14 is NO).The image processing device 1 thereafter outputs the document data 40read from the original, and completes the procedure (step S16). Thedetails of the bookmark processing are described later.

FIG. 16 shows an example of a display screen that is displayed when theuser selects the scan function. The screen shown in FIG. 16 isdisplayed, for example, when the user selects a scan key 65 and then afile key 67. This screen allows the user to select a file format ofimage data acquired from the original from PDF, JPEG and TIFF for outputof the image data. In FIG. 16, a PDF key 68 is shown to be selected.Then, the user operates an OK key 69, by which the file format of theimage data for output is fixed. In the preferred embodiment, a bookmarkextraction key 70 becomes operable, for example, when the user selectsthe PDF key 68. The user thereafter operates the bookmark extraction key70, by which the image processing device 1 is instructed to extract abookmark.

Returning to FIG. 11, when the user selects the BOX function to order aBOX operation after the user logged in (when a result of step S12 is NOand when a result of step S17 is YES), the image processing device 1reads document data 40 from a BOX (storage region 38) designated by theuser (step S18). Then, the image processing device 1 determines whetheror not the user has ordered bookmark extraction (step S19). Whenbookmark extraction has been ordered (when a result of step S19 is YES),the image processing device 1 performs the bookmark processing (stepS20). In contrast, the image processing device 1 does not performbookmark processing when bookmark extraction has not been ordered (whena result of step S19 is NO). The image processing device 1 thereafteroutputs the document data 40 read from the BOX (storage region 38), andcompletes the procedure (step S21). This bookmark processing (step S20)is the same as that performed in step S15, and is described in detaillater.

FIG. 17 shows an example of a display screen that is displayed when theuser selects the BOX function. The screen shown in FIG. 17 is displayed,for example, when the user selects a BOX key 66 and then the file key67. This screen includes document data in a list form stored in the BOXdesignated by the user, and allows the user to select desirable documentdata 40. In FIG. 17, document data with a file name “Report07.pdf” isshown to be selected. Then, the user operates an OK key 74, by whichdocument data to be read from the BOX is fixed. In the preferredembodiment, the bookmark extraction key 70 becomes operable, forexample, when the user selects a PDF file. The user thereafter operatesthe bookmark extraction key 70, by which the image processing device 1is instructed to extract a bookmark.

When the user orders a processing other than the above-describedprocessing (when results of steps S12 and S17 are both NO), the imageprocessing device 1 performs the processing as ordered (such as the copyfunction) (step S22), and then completes the procedure.

FIG. 12 is a flow diagram explaining in detail an exemplary procedure ofthe bookmark processing (steps S15 and S20). After starting the bookmarkprocessing (steps S15 and S20), the image processing device 1 acquiresdocument data as a target of bookmark extraction (step S100). As anexample, the image processing device 1 acquires document data 40 fromthe image memory 35 when a bookmark is to be extracted after scan of anoriginal. As another example, the image processing device 1 acquiresdocument data 40 from a storage region 38 designated by the user when abookmark is to be extracted with a BOX operation.

Next, the image processing device 1 determines whether or not theacquired document data 40 requires a character recognizing processing(step S101). Here, this determination is made based on whether or notthe document data 40 is in text format with characters contained in thedocument data 40 represented by character codes. The image processingdevice 1 determines that the character recognizing processing should beperformed when the document data 40 is not in text format. Whendetermining that the character recognizing processing should beperformed (when a result of step S101 is YES), the image processingdevice 1 performs the character recognizing processing to represent eachof the characters in the document data 40 by a character code, therebyconverting the document data 40 to data in text format (step S102). Incontrast, the image processing device 1 does not perform the characterrecognizing processing when determining that the character recognizingprocessing is not required (when a result of step S101 is NO).

The image processing device 1 thereafter cut all character strings outof a document of the document data 40 based on the document data 40 intext format (step S103). So, all the character strings contained in thedocument data 40 are extracted. Further, the size, the position andothers of each character string in the document are specified.

Next, the image processing device 1 determines based on the formatmanagement table 39 a whether the storage device 36 stores therein aformat 39 associated with the logged-in user, or a format 39 associatedwith the attribute of the document data 40 (step S104). When the formatmanagement table 39 a stores therein the format 39 associated with thelogged-in user, or the format 39 associated with the attribute of thedocument data 40 (when a result of step S104 is YES), the imageprocessing device 1 reads the associated format 39 from the storagedevice 36, and sets the read format 39 as a format to be applied (stepS105). In contrast, the image processing device 1 does not set anyformat 39 as a format to be applied when the format management table 39a does not store an associated format 39 therein. Then, the imageprocessing device 1 displays a setting screen relating to bookmarkextraction on the display unit 15 of the operational panel 17 to receivea user's operation for format setting (step S106).

FIGS. 18A and 18B each shows an exemplary setting screen displayed whenthe user ordered bookmark extraction after selecting the scan function.As shown in FIGS. 18A and 18B, each of these setting screens includes afield 71 for designating a way of bookmark extraction, and a formatsetting field 72. In the setting screen shown in FIG. 18A, “AUTOMATIC”is selected as a way of bookmark extraction. This “AUTOMATIC” means thatcharacter strings are to be extracted as candidates for a bookmark inthe above-described first way. When the format 39 associated with thelogin user “A” is not stored in the storage device 36 of the imageprocessing device 1, for example, “AUTOMATIC” is set as a default, andis displayed on the display unit 15. Then, the user orders extraction ofcharacter strings in the first way by operating an OK key 73.

In the setting screen shown in FIG. 18B, bookmark extraction employingthe format 39 that corresponds to the above-described second way isselected as a way of bookmark extraction. In the setting screen of FIG.18B, “Format_A” associated with the user “A” is set as a format to beapplied. When “Format_A” associated with the logged-in user “A” isstored as one of multiple formats 39 stored in the storage device 36 ofthe image processing device 1, for example, the image processing device1 selects “FORMAT” as a default as a way of bookmark extraction, andfurther selects “Format_A” as a default as a format to be applied. Then,the user orders extraction of character strings by applying the format39 corresponding to “Format_A” by operating an OK key 73. When the userwishes to change the format to be applied at character stringextraction, the user may click a pull down button 72 a provided in theright edge of the format setting field 72. Then, all formats 39 storedin the storage device 36 appear in a pulldown list, so the user canselect a desirable format 39 therefrom. When the user wishes to change away of bookmark extraction to “AUTOMATIC,” the user may click a pulldown button 71 a provided in the right edge of the field 71 fordesignating a way of bookmark extraction. Then, “AUTOMATIC” appears asan alternative in a pulldown list, so the user can change the way ofbookmark extraction by selecting the alternative.

FIGS. 19A and 19B each shows an exemplary setting screen displayed whenthe user has ordered bookmark extraction after selecting the BOXfunction. Like those that are displayed after the scan function isselected, each of the setting screens shown in FIGS. 19A and 19Bincludes the field 71 for designating a way of bookmark extraction andthe format setting field 72. In the setting screen shown in FIG. 19A,“AUTOMATIC” is selected as a way of bookmark extraction. When a format39 associated with the logged-in user “A” is not stored in the storagedevice 36 of the image processing device 1, and when a format 39associated with the file name “Report07.pdf” as one of attributes ofdocument data is not stored in the storage device 36 of the imageprocessing device 1, for example, “AUTOMATIC” is set as a default. Then,the user orders extraction of character strings in the first way byoperating an OK key 73.

In the setting screen shown in FIG. 19B, bookmark extraction employing aformat 39 that corresponds to the above-described second way is selectedas a way of bookmark extraction. In the setting screen of FIG. 19B,“Format Report_” associated with the file name “Report07.pdf” that isone of attributes of document data is set as a format 39 to be applied.When the file name of the document data 40 selected as a target ofbookmark extraction contains “Report”, for example, “Format_Report” isselected as a default as a format 39 associated with a file namecontaining “Report” from multiple formats 39 stored in the storagedevice 36 of the image processing device 1. Then, the user ordersextraction of character strings with applying the format 39corresponding to “Format_Report” by operating an OK key 73. When theuser wishes to change the format to be applied, or when the user wishesto change the way of bookmark extraction to “AUTOMATIC,” the user canmake a desirable change by making the same operation as that describedabove on the screen of FIG. 19B.

After the user specifies the way of character string extraction, theimage processing device 1 proceeds to a character string extractingprocessing (step S107). FIG. 13 is a flow diagram explaining in detailan exemplary procedure of the character string extracting processing.The image processing device 1 determines which way should be used forbookmark extraction, and determines whether or not setting of a formatto be applied is made (step S200). When no format 39 is set as theformat to be applied (when a result of step S200 is NO), the flow goesto step S201. In contrast, the flow goes to step S221 when a format 39is set as the format to be applied (when a result of step S200 is YES).

Steps S201 to S209 correspond to the character string extractingprocessing in the first way. First, the character string extractingprocessing in the first way is described. The image processing device 1extracts all character strings as candidates for a bookmark from theacquired document data 40 that satisfy a predetermined condition forcharacter string extraction (step S201). Here, character strings thatsatisfy a condition for character string extraction defined in advancein the program 37 are all extracted.

Next, based on the respective positions of the bookmark candidatecharacter strings extracted in step S201, the image processing device 1categorizes the bookmark candidate character strings into groups (stepS202). Here, respective positions of the extracted bookmark candidatecharacter strings exist in a document are specified. When there iscommonality among respective positions of two or more character strings,these character strings are categorized into one group.

The image processing device 1 thereafter derives respective portions askeywords from the bookmark candidate character strings extracted in stepS201, and categorizes the bookmark candidate character strings intogroups based on the derived keywords (step S203). As an example,keywords such as “chapter” and “section” defined in advance may bederived. As another example, respective portions each including at leastone letter may automatically be derived from the bookmark candidatecharacter strings. In this case, the suitability of each of the derivedportion as a keyword is evaluated, and the portion with evaluatedsuitability level not less than a certain level is set as a keyword.After respective keywords are derived from the bookmark candidatecharacter strings, the bookmark candidate character strings arecategorized into groups for each common keyword.

The image processing device 1 thereafter derives respective numericalinformation from the bookmark candidate character strings extracted instep S201, and categorizes the bookmark candidate character strings intogroups based on the derived numerical information (step S204). As anexample, numerical information is derived from several initial lettersas a target of extraction of numerical information in each of thebookmark candidate character strings. Then, the bookmark candidatecharacter strings are categorized into groups each having the commonnumber of dots when the derived numerical information includes markssuch as dots between numbers.

The image processing device 1 thereafter categorizes the bookmarkcandidate character strings extracted in step S201 into groups based onthe respective attributes of the bookmark candidate character strings(step S205). Here, the respective attributes of the bookmark candidatecharacter strings such as fonts are determined, and the bookmarkcandidate character strings are categorized into groups each having acommon attribute.

As a result of the above-described processing in steps S202 to S205, oneset of groups for each of the four types of features is obtained fromthe bookmark candidate character strings extracted in step S201. Theimage processing device 1 arranges the bookmark candidate characterstrings in a hierarchical structure based on at least one of the fourtypes of features for which the groups of the bookmark candidatecharacter strings are obtained (step S206). As an example, anindentation differs among the groups into which the bookmark candidatecharacter strings are categorized based on their respective featuresregarding positions. Then, the groups are arranged in the hierarchicalstructure according to the respective indentations of the groups. Then,the bookmark candidate character strings are associated with thehierarchical structure, thereby arranging the bookmark candidatecharacter strings into the hierarchical structure.

The image processing device 1 thereafter associates the respectivefeatures categorized into the groups with the hierarchical structure,thereby arranging two or more groups obtained with respect to each ofthe four types of features into the hierarchical structure (step S207).

Next, the image processing device 1 creates a format 39 that containsthe respective features of the groups arranged in the hierarchicalstructure in the form of data (step S208). More specifically, the formatcreating part 46 generates the above-described common positionalcondition information, common keyword information, common numericalinformation, and common attribute information for each hierarchicallevel based on a feature shared in common in each group associated withthe hierarchical structure, thereby creating the format 39.

Finally, the image processing device 1 links the character stringsextracted as bookmark candidates to the document data 40 (step S209). Inthis linking processing, each character string as a bookmark candidateis given positional information indicating a position where thecharacter string exists in the document data 40.

Steps S221 to S228 correspond to the character string extractingprocessing in the second way. The character string extracting processingin the second way is described next. First, the image processing device1 reads the format 39 set as a format to be applied from the storagedevice 36 (step S221). Then, the image processing device 1 extractscharacter strings based on the common positional conditions defined inthe read format 39 as read (step S222). Here, character stringssatisfying common positional condition information defined for eachhierarchical level of the format 39 are all extracted from the documentdata 40.

The image processing device 1 thereafter extracts character stringsbased on the common keywords defined in the format 39 (step S223). Here,character strings satisfying keyword information defined for eachhierarchical level of the format 39 are all extracted from the documentdata 40.

The image processing device 1 thereafter extracts character stringsbased on the common numerical information defined in the format 39 (stepS224). Here, character strings satisfying common numerical informationdefined for each hierarchical level of the format 39 are all extractedfrom the document data 40.

The image processing device 1 further extracts character strings basedon the common attributes defined in the format 39 (step S225). Here,character strings satisfying common attribute information defined foreach hierarchical level of the format 39 are all extracted from thedocument data 40.

As a result of the above-described processing in steps S222 to S225,character strings satisfying at least one of the common positionalcondition information, common keyword information, common numericalinformation, and common attribute information defined in the format 39read in step S221 are all extracted. Next, in order to reduce thosecharacter strings as bookmark candidates in the extracted characterstrings, the image processing device 1 further extracts those characterstrings that satisfy all types of features defined in the format 39 asbookmark candidates (step S226). More specifically, only characterstrings that satisfy all of the common positional condition information,common keyword information, common numerical information, and commonattribute information defined in the format 39 read in step 221 areextracted for each hierarchical level. Only the character stringsextracted here become bookmark candidates.

Next, the image processing device 1 associates each of the characterstrings extracted as bookmark candidates to the hierarchical structureof the format 39 (step S227), thereby arranging the character stringsextracted as bookmark candidates in the hierarchical structure. Finally,the image processing device 1 links the character strings extracted asbookmark candidates to the document data 40 (step S228). This linkingprocessing is the same as that performed in step S209.

Returning to FIG. 12, after extracting the character strings as bookmarkcandidates in the character string extracting processing (step S107),the image processing device 1 displays a result of the character stringextraction on the display device 20 (step S108). At this time, thescreen shown in FIG. 7 is displayed on the display device 20, forexample, and the detail of the format 39 responsive to the characterstring extracting processing is displayed in the bookmark format displayfield 51. More specifically, the detail of the format 39 created in stepS208 of FIG. 13 is displayed when the character strings are extracted inthe first way. The detail of the format 39 read in step S221 of FIG. 13is displayed when the character strings are extracted in the second way.

The user checks the displayed result of the character string extraction,and proceeds to an editing operation with the mouse 21 and the like tocorrect the format 39 when the result of the character string extractionfails to satisfy a user's desire. In contrast, the user does not proceedto the editing operation of the format 39 when the result of thecharacter string extraction satisfies the user's desire.

After displaying the result of the character string extraction, theimage processing device 1 determines whether or not the user has madethe editing operation of the format 39 (step S109). When the user hasmade the editing operation (when a result of step S109 is YES), theimage processing device 1 proceeds to a format editing processing (stepS110).

FIG. 14 a flow diagram explaining in detail an exemplary procedure ofthe format editing processing (step S110). First, the image processingdevice 1 analyzes the editing operation made by the user (step S300).

When the editing operation is intended to correct the common positionalcondition in the format 39 (when a result of step S301 is YES), theimage processing device 1 corrects the common positional conditioninformation defined in the format 39 (step S302) according to the user'soperation. In contrast, the image processing device 1 bypasses step S302when the editing operation is not intended to correct the commonpositional condition in the format 39 (when a result of step S301 isNO).

When the editing operation is intended to correct the common keyword inthe format 39 (when a result of step S303 is YES), the image processingdevice 1 corrects the common keyword information defined in the format39 (step S304) according to the user's operation. In contrast, the imageprocessing device 1 bypasses step S304 when the editing operation is notintended to correct the common keyword in the format 39 (when a resultof step S303 is NO).

When the editing operation is intended to correct the common numericalinformation in the format 39 (when a result of step S305 is YES), theimage processing device 1 corrects the common numerical informationdefined in the format 39 (step S306) according to the user's operation.In contrast, the image processing device 1 bypasses step S306 when theediting operation is not intended to correct the common numericalinformation in the format 39 (when a result of step S305 is NO).

When the editing operation is intended to correct the common attributein the format 39 (when a result of step S307 is YES), the imageprocessing device 1 corrects the common attribute information defined inthe format 39 (step S308) according to the user's operation. Incontrast, the image processing device 1 bypasses step S308 when theediting operation is not intended to correct the common attribute in theformat 39 (when a result of step S307 is NO).

After performing the above-described editing processing of the format 39in response to the user's editing operation, the image processing device1 sets the corrected format 39 as another format 39 (step S309) to beapplied for subsequent character string extraction.

Then, the flow returns to the character string extracting processing(step S107) as shown in FIG. 12 in which character strings as bookmarkcandidates are extracted again based on the corrected format 39. Thisextraction of character strings (step S107) is realized by following thecharacter string extracting processing in the second way shown in stepsS221 to S228 of FIG. 13. After the character strings are extractedagain, the display screen on the display device 20 is updated (stepS108).

When the editing operation of the format 39 is not made by the user(when a result of step S109 is NO), the image processing device 1generates bookmark data of the hierarchical structure based on thecharacter strings extracted in the character string extractingprocessing (step S107), and adds the bookmark data to the document data40 (step S111). So, for using the document data 40, the bookmark datacan be referred, by which the convenience in the use of the documentdata 40 is enhanced.

The image processing device 1 thereafter saves the format 39 finallyfixed to the storage device 36 (step S112). At this time, the imageprocessing device 1 enters information of the logged-in user inassociation with the format 39, and information of the attribute of thedocument data 40 in association with the format 39 in the formatmanagement table 39 a.

As described, in the preferred embodiment, the image processing device 1functions as a document processing device. So, character strings such asheadings are extracted as bookmark candidates from the document data 40as a target of bookmark extraction. In this extraction, the imageprocessing device 1 derives features shared in common among thecharacter strings extracted as bookmark candidates, and creates a format39 containing the common features in the form of data. The imageprocessing device 1 also displays the character strings in a list formextracted as bookmark candidates and the format 39 in the form of dataon the display device 20, and receives a user's operation for correctionof the format 39. The image processing device 1 corrects the format 39based on the user's operation for correction, and extracts characterstrings again to conform to the corrected format 39.

Even when a character string a user intended is not extracted as abookmark candidate, the user makes an operation to correct the commonpositional condition information, the common keyword information, thecommon numerical information, or the common attribute information ineach hierarchical level with seeing the detail of a format 39 displayedon the display device 20. So, character strings to be bookmarkcandidates are extracted again in response to the user's operation, bywhich a result of character string extraction displayed on the displaydevice 20 is updated. This simplifies the user's operation to correctthe result of character string extraction as compared to that in theconventional technique. As a result, character strings as the userintended are efficiently extracted as a bookmark.

In the preferred embodiment, a format 39 created during the course ofthe character string extracting processing is stored in the storagedevice 36. So, when selecting different documents and trying to extractthe respective bookmarks therefrom, a user sets the format 39 previouslyapplied as a format to be applied each time for subsequent characterstring extraction, and then provides instructions for the characterstring extracting processing in the second way. This reduces the numberof user's operations for correction.

So, according to the document processing device described as an exampleof the preferred embodiment, character strings satisfying apredetermined condition for character string extraction are extractedfrom document data, the respective features of the character strings arederived, and a format containing the features in the form of data iscreated. The extracted character strings are displayed in a list formtogether with the format that is created based on the respectivefeatures of the character strings. The correction made to the format isreceived, and performed. Character strings are extracted again toconform to the corrected format. Even when a character string failing tosatisfy a user is extracted as a bookmark candidate, the user makes anoperation accordingly to correct the format containing the respectivefeatures of the character strings. So, character strings are extractedagain to conform to the corrected format, by which character stringssatisfying the user are efficiently extracted.

While the preferred embodiment of the present invention has beendescribed above, the present invention is not intended to be limited tothe details shown above. Numerous modifications and variations can bedevised without departing from the scope of the invention.

As an example, in the preferred embodiment described above, the format39 is created by deriving common features for bookmark candidatecharacter strings extracted in the first way. After creation of theformat 39, bookmark candidate character strings are extracted in thesecond way with the created format 39. In other words, the first way isonly for extracting bookmark candidate character strings from thedocument data 40. So, the substance of the first way is not intended tobe limited to that described above.

By way of example, the image processing device 1 includes, in additionto the operational panel 17, the display device 20 on which informationsuch as results of character string extraction are displayed. The reasonto provide the display device 20 is that the display unit 15 of theoperational panel 17 is a low-resolution display, and is considered tobe unsuitable for properly displaying information such as a previewimage of a document based on document data 40 and the detail of a format39. So, when the display unit 15 of the operational panel 17 has asatisfactory display resolution, information such as a preview image ofa document based on document data 40 and the detail of a format 39 maybe displayed on the display unit 15 without providing the display device20.

In the preferred embodiment described above, common features to bederived from character strings extracted as bookmark candidates arethose relating to the four types of features including the commonpositional condition information, common keyword information, commonnumerical information, and common attribute information. However,features of other types may be derived.

In the preferred embodiment described above, the image processing device1 mainly functions as a document processing device. In order for thecomputer 3 to function as a document processing device alternatively,the program 37 for executing the procedure explained in the flow diagramof FIG. 12 may be installed on the computer 3 to cause the computer 3 toexecute the procedure based on FIG. 12. In this case, the program 37 maybe provided in the form of data stored in a recording medium such as aCD-ROM.

1. A document processing device comprising: a document data acquiringpart for acquiring document data; a character string extracting part forextracting character strings satisfying a predetermined condition forcharacter string extraction from said document data acquired by saiddocument data acquiring part; a format creating part for deriving therespective features of said character strings extracted by saidcharacter string extracting part, and creating a format containing thederived features in the form of data; a display part on which saidcharacter strings extracted by said character string extracting part aredisplayed in a list form, and on which the format created by said formatcreating part is displayed; and a format correcting part for correctingthe format, wherein said character string extracting part extractscharacter strings again to conform to the format corrected by saidformat correcting part.
 2. The document processing device according toclaim 1, wherein said format creating part arranges the respectivefeatures of character strings in a hierarchical structure based onfeatures shared in common among said character strings; and saidcharacter strings extracted by said character string extracting part areassociated with said hierarchical structure and displayed on saiddisplay part.
 3. The document processing device according to claim 1,wherein the features of each of said character strings derived by saidformat creating part include a positional condition indicating a rangeof the character string from a reference position in a document.
 4. Thedocument processing device according to claim 1, wherein the features ofeach of said character strings derived by said format creating partinclude a keyword contained in the character string.
 5. The documentprocessing device according to claim 1, wherein the features of each ofsaid character strings derived by said format creating part includenumerical information contained in the character string.
 6. The documentprocessing device according to claim 1, wherein the features of each ofsaid character strings derived by said format creating part include theattribute of the character string.
 7. The document processing deviceaccording to claim 1, further comprising a storage part for storingtherein the format created by said format creating part, or the formatcorrected by said format correcting part, wherein said character stringextracting part reads the format from said storage part, and extractscharacter strings from document data to conform to the read format. 8.The document processing device according to claim 7, wherein saidstorage part stores therein the format associated with each user, andsaid character string extracting part reads the format associated with auser who orders extraction of character strings from said storage part,and extracts character strings from document data to conform to the readformat.
 9. The document processing device according to claim 7, whereinthe format stored in said storage part is associated with the attributeof document data, and said character string extracting part reads theformat from said storage part based on the attribute of said documentdata acquired by said document data acquiring part, and extractscharacter strings from document data to conform to the read format. 10.The document processing device according to claim 3, wherein said formatcorrecting part receives an operation to correct a positional conditionindicating a range from the reference position in a document displayedon said display part, and corrects information contained in the formatbased on said operation, and said character string extracting partextracts character strings again to conform to said information in theformat corrected by said format correcting part.
 11. The documentprocessing device according to claim 1, wherein a preview image of adocument based on said document data acquired by said document dataacquiring part is also displayed on said display part, and each of saidcharacter strings in the list form and each character strings containedin said preview image are displayed as being linked to each other onsaid display part.
 12. The document processing device according to claim1, wherein said document data acquiring part includes a characterrecognizing part for converting received image data to document data intext format by performing a character recognizing processing on saidreceived image data.
 13. A document processing method, comprising thesteps of: (a) acquiring document data; (b) extracting character stringssatisfying a predetermined condition for character string extractionfrom the acquired document data; (c) deriving the respective features ofthe extracted character strings, and creating a format containing thederived features in the form of data; (d) displaying said extractedcharacter strings in a list form, and displaying the format; (e)correcting the format; and (f) when the format is corrected, extractingcharacter strings again to conform to the corrected format.
 14. Thedocument processing method according to claim 13, wherein in said step(c), the respective features of character strings are arranged in ahierarchical structure based on features shared in common among saidcharacter strings, and in said step (d), said character stringsextracted in said step (b) are associated with said hierarchicalstructure and displayed.
 15. The document processing method according toclaim 13, wherein the features of each of said character strings derivedin said step (c) include a positional condition indicating a range ofthe character string from a reference position in a document.
 16. Thedocument processing method according to claim 13, wherein the featuresof each of said character strings derived in said step (c) include akeyword contained in the character string.
 17. The document processingmethod according to claim 13, wherein the features of each of saidcharacter strings derived in said step (c) include numerical informationcontained in the character string.
 18. The document processing methodaccording to claim 13, wherein the features of each of said characterstrings derived in said step (c) include an attribute contained in thecharacter string.
 19. The document processing method according to claim13, further comprising the step of: (g) storing the format created insaid step (c), or the format corrected in said step (e), wherein in saidstep (b), the format stored in said step (g) is read, and characterstrings are extracted from document data to conform to the read format.20. A computer-readable recording medium on which a program is recorded,said program causing a computer to operate as a system comprising: adocument data acquiring part for acquiring document data; a characterstring extracting part for extracting character strings satisfying apredetermined condition for character string extraction from saiddocument data acquired by said document data acquiring part; a formatcreating part for deriving the respective features of said characterstrings extracted by said character string extracting part, and forcreating a format containing the derived features in the form of data; adisplay part on which said character strings extracted by said characterstring extracting part are displayed in a list form, and on which theformat created by the format creating part is displayed; and a formatcorrecting part for correcting said format, said program causing saidcharacter string extracting part to extract character strings again toconform to the format corrected by said format correcting part.